Allocation Strategies for Data-Oriented Architectures
نویسنده
چکیده
Data orientation is a common design principle in distributed data management systems. In contrast to process-oriented or transaction-oriented system designs, dataoriented architectures are based on data locality and function shipping. The tight coupling of data and processing thereon is implemented in different systems in a variety of application scenarios such as data analysis, database-as-a-service, and data management on multiprocessor systems. Data-oriented systems, i.e., systems that implement a data-oriented architecture, bundle data and operations together in tasks which are processed locally on the nodes of the distributed system. Allocation strategies, i.e., methods that decide the mapping from tasks to nodes, are core components in data-oriented systems. Good allocation strategies can lead to balanced systems while bad allocation strategies cause skew in the load and therefore suboptimal application performance and infrastructure utilization. Optimal allocation strategies are hard to find given the complexity of the systems, the complicated interactions of tasks, and the huge solution space. To ensure the scalability of dataoriented systems and to keep them manageable with hundreds of thousands of tasks, thousands of nodes, and dynamic workloads, fast and reliable allocation strategies are mandatory. In this thesis, we develop novel allocation strategies for data-oriented systems based on graph partitioning algorithms. Therefore, we show that systems from different application scenarios with different abstraction levels can be generalized to generic infrastructure and workload descriptions. We use weighted graph representations to model infrastructures with bounded and unbounded, i.e., overcommited, resources and possibly non-linear performance characteristics. Based on our generalized infrastructure and workload model, we formalize the allocation problem, which seeks valid and balanced allocations that minimize communication. Our allocation strategies partition the workload graph using solution heuristics that work with single and multiple vertex weights. Novel extensions to these solution heuristics can be used to balance penalized and secondary graph partition weights. These extensions enable the allocation strategies to handle infrastructures with non-linear performance behavior. On top of the basic algorithms, we propose methods to incorporate heterogeneous infrastructures and to react to changing workloads and infrastructures by incrementally updating the partitioning. We evaluate all components of our allocation strategy algorithms and show their applicability and scalability with synthetic workload graphs. In end-to-end– performance experiments in two actual data-oriented systems, a database-as-aservice system and a database management system for multiprocessor systems, we prove that our allocation strategies outperform alternative state-of-the-art methods.
منابع مشابه
System Dynamics Aproach For Quantitative Risk Allocaion
Allocation of construction risks between clients and their contractors has a significant impact on the total construction costs. This paper presents a system dynamics (SD)-based approach for quantitative risk allocation. Using the proposed SD based approach, all the factors affecting the risk allocation process are modeled. The contractor’s defensive strategies against the one-sided risk alloca...
متن کاملA Reference Architecture for Automation of Inter-Organizational Process-Oriented Collaboration
In today’s competitive, dynamic, and changing business environment, being able to collaborate globally within and beyond the enterprise borders is critical. Inter-Organizational Collaborations (IOCs) have been proposed as a response to the characteristics of highly competitive global business environments. So far, a number of reference models, frameworks, and ad hoc architectures related to som...
متن کاملOptimized Dynamic Allocation Management for ERP Systems and Enterprise Services
To ensure the operability and reliability of large scale Enterprise Resource Planning Systems (ERP) and enterprise services, a peak-load oriented hardware sizing is often used, which results in low average utilization. The evaluation of historical load data revealed that many applications show cyclical resource consumption. The identification of load patterns can be used for static as well as d...
متن کاملOn Flexible Allocation of Index and Temporary Data in Parallel Database Systems
Data placement is a key factor for high performance database systems. This is particularly true for parallel database systems where data allocation must support both I/O parallelism and processing parallelism within complex queries and between independent queries and transactions. Determining an effective data placement is a complex administration problem depending on many parameters including ...
متن کاملEvaluating the effect of using biophilic design and architecture strategies on the improvement of physical and mental health indicators of patients admitted to selected hospitals from the perspective of architectures, patients and patiens’ companions
Introduction: The revolution in hospital design is called biophilic design; because the application of biophilic design in hospitals is one of the cornerstones of the humanization of health centers and hospitals. In this regard, the aim of this study was to Evaluating the effect of using biophilic design and architecture strategies on the improvement of physical and mental health indicators ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016